Traveling Speculations: An Integrated Prediction Strategy for Wide-Issue Microprocessors

نویسندگان

  • Ravi Bhargava
  • Juan Rubio
  • Lizy K. John
چکیده

Performing multiple, accurate, low-latency predictions is crucial to improving instruction throughput in future wide-issue microprocessors. However, demands of wide-issue processing coupled with implementation challenges posed by high clock frequencies present obstacles to these prediction goals. This paper proposes the Traveling Speculation framework to accommodate predictions in a wide-issue environment. Instead of using large centralized data tables to perform predictions, distributed predictors are introduced along with the Traveling Speculation hint, or tint. A tint contains execution history and is assigned to each instruction that is in-flight or in the trace cache. The tint can be related to branch behavior, cache behavior, value prediction, or any history-based aspect of microarchitecture. By associating a unique tint with each instruction in the trace cache, a per-instruction, interference-free execution history is maintained, providing superior prediction accuracy and instruction coverage in a low-latency manner. Our results show that the Traveling Speculation framework improves prediction accuracy and provides personalized execution history to a high percentage of dynamic instructions. Value prediction using Traveling Speculation tints improves instruction throughput by 9.7% over the base model, in comparison to 5.0% obtained by a table-based value predictor. Similarly, speculative memory forwarding using tints improves instruction throughput by 11.8% compared to 7.3% obtained using a memory renaming scheme.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Numerical Prediction of Stator Diameter Effect on the Output Torque of Ultrasonic Traveling-wave Motor, using Finite Elements Simulation

Nowadays, piezoelectric materials have wide applications in various industries. Therefore, investigation of these materials and their applications has a special importance. In this paper first, the natural frequencies of a traveling-wave piezoelectric motor are achieved, using finite elements simulations. Then, applying an alternative electrical voltage to the piezoelectric ring, a traveling wa...

متن کامل

Value Prediction Design for High-Frequency Microprocessors

This paper studies value predictor design under table access latency and energy constraints for high-frequency, wide-issue microprocessors. Previous value prediction efforts make generous assumptions regarding table sizes and access conditions, while ignoring prediction latencies and energy issues. Our work shows that the latency of a high-performance value predictor cannot be completely hidden...

متن کامل

New Heuristic Algorithms for Solving Single-Vehicle and Multi-Vehicle Generalized Traveling Salesman Problems (GTSP)

Among numerous NP-hard problems, the Traveling Salesman Problem (TSP) has been one of the most explored, yet unknown one. Even a minor modification changes the problem’s status, calling for a different solution. The Generalized Traveling Salesman Problem (GTSP)expands the TSP to a much more complicated form, replacing single nodes with a group or cluster of nodes, where the objective is to fi...

متن کامل

Multiple Branch Prediction for Wide - Issue Superscalar ∗

Modern micro-architectures employ superscalar techniques to enhance system performance. Since the superscalar microprocessors must fetch at least one instruction cache line at a time to support high issue rate and large amount speculative executions. There are cases that multiple branches are often encountered in one cycle. And in practical implementation this would cause serious problem while ...

متن کامل

Automatic Abstraction of Memories in the Formal Verification of Superscalar Microprocessors

A system of conservative transformation rules is presented for abstracting memories whose forwarding logic interacts with stalling conditions for preserving the memory semantics in microprocessors with in-order execution. Microprocessor correctness is expressed in the logic of Equality with Uninterpreted Functions and Memories (EUFM) [6]. Memory reads and writes are abstracted as arbitrary unin...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2002